Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.
نویسندگان
چکیده
This paper presents a novel approach to profile-profile comparison. The method compares two input profiles (like those that are generated by PSI-BLAST) and assigns a similarity score to assess their statistical similarity. Our profile-profile comparison tool, which allows for gaps, can be used to detect weak similarities between protein families. It has also been optimized to produce alignments that are in very good agreement with structural alignments. Tests show that the profile-profile alignments are indeed highly correlated with similarities between secondary structure elements and tertiary structure. Exhaustive evaluations show that our method is significantly more sensitive in detecting distant homologies than the popular profile-based search programs PSI-BLAST and IMPALA. The relative improvement is the same order of magnitude as the improvement of PSI-BLAST relative to BLAST. Our new tool often detects similarities that fall within the twilight zone of sequence similarity.
منابع مشابه
Profile-Profile Comparison Based on Hidden Markov Model Profiles
The detection of sequence similarity within the twilight zone has been a challenging problem in sequence analysis. Among various types of approaches for sequence similarity detection, the profileprofile comparison is one of the most reliable approaches [4]. In this study, we design a profile, named match-node profile, which is generated from match nodes in the Hidden Markov Model (HMM) profiles.
متن کاملComparison of structure-based and threading-based approaches to protein functional annotation.
To exploit the vast amount of sequence information provided by the Genomic revolution, the biological function of these sequences must be identified. As a practical matter, this is often accomplished by functional inference. Purely sequence-based approaches, particularly in the "twilight zone" of low sequence similarity levels, are complicated by many factors. For proteins, structure-based tech...
متن کاملImproving the quality of twilight-zone alignments.
Several recent publications illustrated advantages of using sequence profiles in recognizing distant homologies between proteins. At the same time, the practical usefulness of distant homology recognition depends not only on the sensitivity of the algorithm, but also on the quality of the alignment between a prediction target and the template from the database of known proteins. Here, we study ...
متن کاملComparison of three types of G × E performance plot for showing and interpreting genotypes’ stability and adaptability
A G × E performance (interaction, profile) plot for showing genotype-by-environment data is discussed. Three versions of such a plot are compared: the regular performance plot; the performance plot based on coded data (environment-centered performance plot), in which the environment means of a trait are subtracted from data; and the performance plot based on data standardized in environments (e...
متن کاملCoupled Eulerian-Lagrangian (CEL) Modeling of Material Flow in Dissimilar Friction Stir Welding of Aluminum Alloys
In this work, the finite element simulation of dissimilar friction stir welding process is investigated. The welded materials are AA 6061-T6 and AA 7075-T6 aluminum alloys. For this purpose, a 3D coupled thermo-mechanical finite element model is developed according to the Coupled Eulerian-Lagrangian (CEL) method. The CEL method has the advantages of both Lagrangian and Eulerian approaches, whic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of molecular biology
دوره 315 5 شماره
صفحات -
تاریخ انتشار 2002